• Article  

      Characterizing crawler behavior from web server access logs 

      Dikaiakos, Marios D.; Stassopoulou, Athena; Papageorgiou, Loizos (2003)
      In this paper, we present a study of crawler behavior based on Web-server access logs. To this end, we use logs from five different academic sites in three countries. Based on these logs, we analyze the activity of different ...
    • Conference Object  

      Distributed location aware web crawling 

      Papapetrou, Odysseas; Samaras, George S. (2004)
      Distributed crawling has shown that it can overcome important limitations of the today's crawling paradigm. However, the optimal benefits of this approach are usually limited to the sites hosting the crawler. In this work, ...
    • Article  

      An investigation of web crawler behavior: Characterization and metrics 

      Dikaiakos, Marios D.; Stassopoulou, Athena; Papageorgiou, Loizos (2005)
      In this paper, we present a characterization study of search-engine crawlers. For the purposes of our work, we use Web-server access logs from five academic sites in three different countries. Based on these logs, we analyze ...
    • Conference Object  

      ViSMA: Extendible, mobile-agent based services for the materialization and maintenance of personalized and shareable Web views 

      Samaras, George S.; Karenos, K.; Chrysanthis, Panos K.; Pitoura, Evaggelia 1967- (Institute of Electrical and Electronics Engineers Inc., 2003)
      ViSMA (Views Supported by Mobile Agents) is a prototype set of extendible mobile-agent based services that allow the definition, materialization, maintenance and sharing of views created over remote Web-accessible databases. ...
    • Article  

      Web robot detection: A probabilistic reasoning approach 

      Stassopoulou, Athena; Dikaiakos, Marios D. (2009)
      In this paper, we introduce a probabilistic modeling approach for addressing the problem of Web robot detection from Web-server access logs. More specifically, we construct a Bayesian network that classifies automatically ...